# Multilingual speech processing

Whisper Uz
Apache-2.0
Uzbek speech recognition model fine-tuned on Whisper Base, trained on the Common Voice dataset
Speech Recognition Transformers Other
W
jamshidahmadov
1,179
3
Wav2vec2 Large Xlsr 53 Tr Fine Tuning Deprecated
Apache-2.0
This model is a speech recognition model fine-tuned on the Common Voice Turkish dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Transformers
W
bekirbakar
17
0
Wav2vec2 Large Xls R 300m Irish Colab Test
Apache-2.0
This is a speech recognition model fine-tuned on the Common Voice Irish dataset based on the facebook/wav2vec2-xls-r-300m model, primarily used for automatic speech recognition tasks in Irish.
Speech Recognition Transformers
W
jfealko
24
0
Output
This model is an automatic speech recognition model fine-tuned on the Abkhaz language dataset, based on the XLS-R architecture
Speech Recognition Transformers Other
O
deepdml
25
0
Wav2vec2 Large Xlsr Rm Sursilv
Apache-2.0
This is an automatic speech recognition model fine-tuned from the facebook/wav2vec2-large-xlsr-53 model, specifically designed for recognizing the Sursilvan dialect of Romansh.
Speech Recognition
W
gchhablani
27
0
Wav2vec2 Large Xlsr Slovene
Apache-2.0
This is a Slovenian speech recognition model fine-tuned from Facebook's wav2vec2-large-xlsr-53 model, trained using the Common Voice dataset.
Speech Recognition Other
W
mrshu
23
2
Wav2vec2 Large Xls R 300m Welsh
Apache-2.0
This is an automatic speech recognition model fine-tuned on the Welsh dataset based on facebook/wav2vec2-xls-r-300m, achieving a word error rate of 31.003% and a character error rate of 7.775% on the Common Voice 7 Welsh test set.
Speech Recognition Transformers Other
W
infinitejoy
89
0
Xls R Et V 3
Apache-2.0
This model is an automatic speech recognition model fine-tuned on Estonian datasets based on facebook/wav2vec2-xls-r-1b
Speech Recognition Transformers Other
X
vasilis
41
0
Wav2vec2 Base 10k Voxpopuli Ft Hr
A speech recognition model based on Facebook's Wav2Vec2 architecture, pretrained on the VoxPopuli corpus and fine-tuned on Croatian data
Speech Recognition Transformers Other
W
facebook
20
0
Wav2vec2 Large West Germanic Voxpopuli V2
Facebook's Wav2Vec2 large model, pretrained exclusively on 66.3 hours of unlabeled data from the West Germanic VoxPopuli corpus.
Speech Recognition Transformers
W
facebook
25
1
Wav2vec2 Large El Voxpopuli V2
Greek speech recognition model pretrained on VoxPopuli corpus using 17.7 hours of unlabeled data
Speech Recognition Transformers Other
W
facebook
24
0
Wav2vec2 Large North Germanic Voxpopuli V2
Large speech model pre-trained on North Germanic language corpus from VoxPopuli
Speech Recognition Transformers
W
facebook
25
0
Wav2vec2 Xls R 300m Turkish Tr Med
Apache-2.0
This model is a Turkish speech recognition model fine-tuned on common speech datasets based on facebook/wav2vec2-xls-r-300m
Speech Recognition Transformers
W
emre
22
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase